# XLSR fine-tuning
Ai Light Dance Stepmania Ft Wav2vec2 Large Xlsr 53 V4
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the GARY109/AI_LIGHT_DANCE - ONSET-STEPMANIA2 dataset, based on gary109/ai-light-dance_stepmania_ft_wav2vec2-large-xlsr-53-v3.
Speech Recognition
Transformers

A
gary109
189
0
Wav2vec2 Large Ru Golos
Apache-2.0
A Russian speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on the Sberdevices Golos dataset, supporting 16kHz audio input
Speech Recognition
Transformers Other

W
bond005
1,182
12
Xlrs Best Lm
Apache-2.0
This is an Indonesian automatic speech recognition model based on the XLSR Wav2Vec2 architecture, fine-tuned on a public Indonesian speech dataset.
Speech Recognition
Transformers Other

X
ridhoalattqas
19
1
Wav2vec2 Large Xlsr Es Col Pro Noise
Apache-2.0
A Spanish speech recognition model fine-tuned from jonatasgrosman/wav2vec2-large-xlsr-53-spanish, optimized for Colombian accent and noisy environments
Speech Recognition
Transformers

W
Santiagot1105
18
0
Wav2vec2 Large Xlsr Es Col Pro
Apache-2.0
A Spanish (Colombian accent) speech recognition model fine-tuned based on jonatasgrosman/wav2vec2-large-xlsr-53-spanish
Speech Recognition
Transformers

W
Santiagot1105
20
0
Wav2vec2 Large Xlsr Es Col Test
Apache-2.0
This is a speech recognition model fine-tuned on a specific dataset based on jonatasgrosman/wav2vec2-large-xlsr-53-spanish model, supporting Spanish.
Speech Recognition
Transformers

W
Santiagot1105
30
1
Wav2vec2hindiasr
Apache-2.0
Hindi automatic speech recognition (ASR) model based on Wav2Vec2 architecture, fine-tuned on public speech datasets
Speech Recognition
Transformers

W
SAGAR4REAL
31
1
Wav2vec2 Large Xlsr 53 English
Apache-2.0
An English speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained on the Common Voice 6.1 dataset
Speech Recognition English
W
jonatasgrosman
251.78k
471
Wav2vec2 Large Xlsr 53 Slovenian
Apache-2.0
This is a Slovenian automatic speech recognition model fine-tuned from Facebook's wav2vec2-large-xlsr-53 model, trained on the Common Voice dataset with a word error rate of 36.04%.
Speech Recognition Other
W
anton-l
15.02k
0
Wav2vec2 Large Xlsr Kazakh
Apache-2.0
This is a Kazakh automatic speech recognition (ASR) model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on the Kazakh speech corpus v1.1 with a test WER of 19.65%.
Speech Recognition Other
W
aismlv
12.08k
17
Wav2vec2 Large Xlsr Kyrgyz
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Kyrgyz Common Voice dataset, based on the facebook/wav2vec2-large-xlsr-53 model.
Speech Recognition Other
W
iarfmoose
22
2
Wav2vec2 Large Xlsr 53 Ukrainian
Apache-2.0
A Ukrainian automatic speech recognition (ASR) model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on the Common Voice dataset.
Speech Recognition Other
W
anton-l
21
1
Wav2vec2 Large Xlsr 53 Breton
Apache-2.0
A Breton fine-tuned speech recognition model based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Other
W
mrm8488
26
0
Wav2vec2 10july
Apache-2.0
This is a German automatic speech recognition model based on the XLSR Wav2Vec2 architecture, fine-tuned on the Common Voice German dataset.
Speech Recognition
Transformers German

W
sourabharsh
24
0
Wav2vec2 Large Xlsr 53 Hungarian
Apache-2.0
This is a Hungarian automatic speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice dataset.
Speech Recognition Other
W
anton-l
17
0
Wav2vec2 Large Xlsr 53 Eu
Apache-2.0
A Basque automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, achieving a 15.34% word error rate (WER) on the Common Voice Basque test set.
Speech Recognition
Transformers Other

W
pcuenq
1,378
0
Wav2vec2 Large Xlsr Turkish Artificial
Apache-2.0
This is a Turkish speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained using artificial Common Voice dataset.
Speech Recognition Other
W
cahya
25
1
Wav2vec2 Large Xlsr Cnh
Apache-2.0
A Hakha Chin speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained on the Common Voice dataset with a test WER of 31.38%.
Speech Recognition Other
W
gchhablani
22
0
Wav2vec2 Large Xlsr 53 Polish
Apache-2.0
XLSR-53 large model speech recognition system optimized for Polish, fine-tuned based on facebook/wav2vec2-large-xlsr-53, supports Polish automatic speech recognition
Speech Recognition Other
W
jonatasgrosman
412.13k
11
Wav2vec2 Large Xlsr Estonian
Apache-2.0
This is an Estonian automatic speech recognition (ASR) model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice dataset.
Speech Recognition Other
W
m3hrdadfi
26
0
Wav2vec2 Large Xlsr 53 Irish
Apache-2.0
A speech recognition model fine-tuned for Irish language using the Common Voice dataset, based on facebook/wav2vec2-large-xlsr-53.
Speech Recognition
W
cpierse
22
0
Wav2vec2 Large Xlsr Hindi Commonvoice
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53 on the common_voice dataset, primarily used for Hindi speech recognition tasks.
Speech Recognition
Transformers

W
nikhil6041
17
0
Greek Lsr 1
Apache-2.0
An automatic speech recognition model fine-tuned on Greek language based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
Transformers Other

G
skylord
17
0
Wav2vec2 Large Xlsr 53 Demo Colab
Apache-2.0
This model is a speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-large-xlsr-53, primarily used for robust speech event recognition.
Speech Recognition
Transformers

W
emre
16
0
Wav2vec2 Large Xlsr 53 Latvian
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Latvian Common Voice dataset based on Facebook's Wav2Vec2-Large-XLSR-53 model.
Speech Recognition Other
W
anton-l
137
1
Wav2vec2 Large Xlsr 53 Rm Vallader
Apache-2.0
A fine-tuned speech recognition model for the Romansh Vallader dialect based on facebook/wav2vec2-large-xlsr-53, achieving a word error rate of 32.89%
Speech Recognition
W
anuragshas
58
0
Xlsr Indonesia
Apache-2.0
Indonesian automatic speech recognition (ASR) model fine-tuned on the XLSR architecture, trained on the Common Voice Indonesian dataset
Speech Recognition
Transformers Other

X
acul3
23
0
Wav2vec2 Large Xlsr Nahuatl
Apache-2.0
A Nahuatl (ncj dialect) speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
Transformers

W
tyoc213
18
1
Wav2vec2 Large Xlsr Georgian
Apache-2.0
Georgian automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input
Speech Recognition
Transformers Other

W
xsway
14.80k
1
Wav2vec2 Large Xlsr Mongolian
Apache-2.0
An automatic speech recognition model fine-tuned on the Mongolian Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Other
W
manandey
4,719
0
Wav2vec2 Large Xlsr Javanese
Apache-2.0
A Javanese automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on high-quality Javanese TTS data from OpenSLR.
Speech Recognition Other
W
cahya
659
0
Wav2vec2 Large Xlsr Sundanese
Apache-2.0
A Sundanese speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on high-quality TTS data from OpenSLR
Speech Recognition Other
W
cahya
339
0
Wav2vec2 Large Xlsr Hungarian
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Hungarian Common Voice dataset, based on the facebook/wav2vec2-large-xlsr-53 model.
Speech Recognition Other
W
birgermoell
31
1
Wav2vec2 Large Xlsr 53 Tatar
Apache-2.0
An automatic speech recognition model fine-tuned on Tatar language based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input.
Speech Recognition Other
W
crang
163
1
Wav2vec2 Large Xlsr Coraa Portuguese Cv8
Apache-2.0
A Portuguese speech recognition model fine-tuned on the Common Voice dataset based on Edresson/wav2vec2-large-xlsr-coraa-portuguese
Speech Recognition
Transformers

W
lgris
34
0
Wav2vec2 Large Xlsr 53 Kyrgyz
Apache-2.0
This is a Kyrgyz automatic speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained using public speech datasets.
Speech Recognition Other
W
anton-l
32
0
Wav2vec2 Large Xlsr Punjabi
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on Punjabi speech data based on the facebook/wav2vec2-large-xlsr-53 model.
Speech Recognition
W
manandey
20.46k
1
Wav2vec2 Large Xlsr Arabic
Apache-2.0
A speech recognition model fine-tuned on the Arabic Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
Transformers Arabic

W
othrif
302
0
Wav2vec2 Large Xlsr 53 Turkish
Apache-2.0
A Turkish speech recognition model fine-tuned on the Common Voice dataset based on Facebook's wav2vec2-large-xlsr-53 model
Speech Recognition Other
W
aniltrkkn
68
0
Wav2vec2 Large Xlsr 53 Ia
Apache-2.0
An Interlingua speech recognition model fine-tuned from Facebook's wav2vec2-large-xlsr-53 model, achieving a 22.08% word error rate on the Common Voice Interlingua dataset.
Speech Recognition Other
W
anuragshas
28
0
- 1
- 2
- 3
Featured Recommended AI Models